04:39
2026-06-14
twitter.com
ai-agents
Initial Results on Legal Agent Benchmark
Gabe Pereyra released the Legal Agent Benchmark (LAB), an open-source benchmark for evaluating AI agents on complex legal tasks, and shared initial results on frontier model performance in long-horizoβ¦